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MOLECULAR MARKERS FOR DIAGNOSING 
HEPATOCELLULAR CARCINOMA 

Reference to Government Grant 

This invention was made in the course of research sponsored 
by the National Institutes of Health grants CA48656 and CA66971. The 
U.S. Government has certain rights in the invention. 

Background of the Invention 

Primary hepatocellular carcinoma (HCC) is one of the most 
common tumors seen in certain areas of the world. In Asia and sub- 
Saharan Africa it has an annual incidence rate of 500 cases per 100,000 
population. In the United States and Europe, HCG accounts for 1 to 2 
percent of tumors seen at autopsy (Podolsky, D K and K.J. Isselbacher. 
1994. Harrison's Principles of Internal Medicine, pp. 1496-1497). There are 
risk factors for HCC, however, that can lead to a large increase in the 
likelihood that tumors will develop: For example, HCC is usually associated 
with a cirrhotic liver, making alcoholics more likely to develop these tumors. 

The increased incidence of HCC in Asian and African 
populations and elsewhere has been attributed to the high incidence of 
chronic infection with hepatitis B virus (HBV) and hepatitis C virus (HCV). 
These chronic infections can lead to hepatitis and cirrhosis which are the 
most common risk factors for HCC. The link between HBV infection and 
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HCC is well established. Studies in Asia have shown that the incidence of 
this form of cancer over time is increased 100-fold in individuals with 
evidence of HBV infection as compared to non-infected controls (Podolsky, 
D.K. and K.J. Isselbacher. 1994. Harrison's Principles of Internal Medicine, 
pp. 1496-1497). More recent work in Europe and Japan has shown that 
HCV is also linked to an Increased risk of HCC. In fact, any agent or factor 
that contributes to chronic, low-grade liver cell damage would make liver 
cell DNA more susceptible to damage and genetic alterations which can 
lead to carcinogenesis. The mechanisms and steps responsible for the 
development of HCC, however, have not been fully elucidated. 

The finding that HBV makes a genetic contribution to the 
development of HCC (Seegerefa/. 1991. J. V/ro/. 65:1673-1679) suggests 
that one or more virus encoded proteins may play a role in 
hepatoearcinogenesis. Other data suggests that hepatitis B x antigen 
(HBxAg) contributes to the pathogenesis of HCC- HBxAg transforms a 
mouse hepatocyte cell line both in vitro and in vivo (Hohne, M. et ai. 1990. 
EMBO J. 9:1137-1145; Seifer, M. et al. 1991. J. Hepatol. 13:S61-S65). 
HBxAg binds to and functionally inactivates the tumor suppressor p53 
(Feitelson, MA. et al. 1993. Oncogene 8:1109-1117; Wang, X.W. et al. 
1994. Proc. Natl. Acad, Set. USA 91:2230-2234; Truant, R. et al. 1995. j. 
Virol. 69:1851-1859; Takeda, S et al. 1995. J. Cancer Res Clin. Oncol. 
121:593-601). HBxAg/p53 staining and complex formation has also been 
shown to correlate with the development of liver tumors in a X transgenic 
mouse model with sustained high levels of HBxAg expression (Kim, CM. 
et al. 1991. Nature 351 :317-320; Koike, K. et al. 1994. Hepatology 1 9:81 0- 
819; Ueda, H. et al. 1995. Nature Genetics 9:41-47). 

It has previously been shown that HBxAg is a trans-activating 
protein (Twu, J.S. and R.H. Schloemer. 1987. J. Virol. 61:3448-3453; 
Rossner, M.T 1992. J. Med. Virol. 36:101-117; Henkler, F. and R. Koshy 
1996. J. Viral Hepatitis 3:109-121). Even though virus DNA fragments 
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integrated into HCC cells often contain the X region (Matsubara, K and T. 
Tokino. 1990, Mol. Biol. Med. 7:243-260; Unsal, H. et al. 1994 Proc, Natl. 
Acad. Sci. USA 91:822-826) and HBxAg made from these integrated 
sequences has transactivating-activity, it is not clear that this action is 
responsible for transformation (Luber, B.L. et al. 1996. Oncogene 12:1597- 
1608). A variety of studies have described differences in gene expression 
which distinguish tumor (HCC) form nontumor (liver) cells (Begum, N. A., 
M. Mori, T. Matsurriaia, K. Takenaka, K, Sugimachi, and G. F. Barnard 
1995. Hepatology 22:1447-1455; Darabi, A.. S. Gross, M. Watabe, M. 
Malafa, and K. Watabe. 1995. Cancer Lett. 95:153-159; Inui, Y., S. 
Higashlyama, S. Kawata, S. Tamura, J.-l. Miyagawa, N. Taniguchi, and Y. 
Matsuzawa. 1994. Gastroenterology 107:1799-1804; Kim, S, O., J. G. 
Park, and Y. I. Lee. 1996. Cancer Res. 56:3831-3836; Ohmachi, Y-, A. 
Murata, T. Yasuda, K. Kitagawa, S; Yamamoto, M. Monden, T. Mori, N. 
Matsuura, and K. Matsubara. 1994. J. Hepatol. 21:1012-1016; Su, W., J. 
F. Liu, S. X. Zhang, D, F Li, and J. J. Yang. 1994. Hepatology 19:788-799, 
Uekt, T., J Fujimoto, T. Suxukt, H. Yariiamoto, and E. Okamoto. 1997. 
Hepatology 25:862-866; Yamashita, N., H Ishibashi, K. Hayashida, J. 
Kudo, K. Takenaka, K. Itch, and Y. Niho. 1996. Hepatology 24: 1437-1 440; 
Zhou, M. X:, M. Watabe, and K. Watabe. 1994. Arch. Virol. 134:369-378). 
However, no indication has been given whether any of these genes are 
turned on or off by HBxAg. 

One of the problems associated with any type of cancer is 
ensuring early detection and risk factor screening so that disease can be 
more successfully treated. Detection of HCC may escape clinical 
recognition because of the presence of other active disease processes, 
such as hepatitis or cirrhosis. One screening tool has been alpha 
fetoprotein levels, where levels greater than 500 ug/L are found in 70-80% 
of patients with HCC (Podolsky, D.K. and K.J. Isselbacher. 1994. Harrison's 
Principles of Internal Medicine, pp. 1496-1497). The most common 
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diagnostic tools are imaging with ultrasound, which can only detect the 
presence of visible tumors, and liver biopsy. Neither of these diagnostic 
tools is able to screen individuals for the risk of disease before tumors 
develop. In biopsy, it can be difficult to distinguish large cirrhotic nodules 
from well-differentiated HCC or low-grade dysplastic nodules from HCC. 

Clearly, there is a need for better methods of early diagnosis, 
as well as risk screening. Criteria for judging the usefulness of HCC 
screening methods were recently reviewed by Collier and Sherman, 1 988. 
Hepatology 27:273-278. 

Summary of the invention 

The invention is a method for detecting hepatocellular 
carcinoma in liver tissue of a patient. A liver tissue sample is obtained from 
the patient, and the level of expression of one or more marker genes in the 
sample is assessed. The marker genes are differentially expressed in 
HBxAg[+] cells as compared with HBxAg[-] cells. A reduction in the level 
of expression of one or more marker genes in the sample as compared to 
the expression level in noncancerous liver tissue is indicative of 
hepatocellular carcinoma in the sample. 

According to an embodiment of the invention, the marker 

gene is selected from the group of genes expressing RNA transcripts which 
hybridize under conditions of high stringency to a nucleic acid probe 
selected from the group consisting of SEQ ID NO:1 , SEQ ID NO:2. SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ JD 
NO:8, SEQ ID NO:9, and SEQ ID NOMO. More particularly, the marker 
gene is selected from the group consisting of 

a gene which encodes the polypeptide of SEQ ID 

NO:27; 

a gene which encodes the polypeptide of SEQ ID 

NO:28; 
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a gene which encodes the polypeptide of SEQ ID 
NO:29; 

a gene which encodes the polypeptide of SEQ ID 
NO:29; 

bu-sui; 

human tubulin-folding cofactor E gene; 
human myeloblast KIAA0132 gene; and 
the human fetal heart gene, the cDNA of which is 
identified as GenBank accession number AA047006. 
An example of high stringency hybridization conditions is 
hybridization at4XSSC at65°C, followed by washing in 0.1XSSC at 65°C 
for one hour. Another example of high stringency hybridization conditions 
is hybridization in 50% formamide, 4XSSC at 55 °C. 

According to one embodiment of the invention, the step of 
assessing the level of expression of the marker gene in the sample 
comprises contacting the sample with one or more probes which detect 
mRNA which is differentially expressed in HBxAg[+] cells as compared with 
HBxAgf-] cells. 

According to another embodiment of the invention, the step 
of assessing the level of expression of the marker gene by in the sample 
comprises assessing the ievel of expression of marker protein encoded by 
one or more marker genes. Detection of marker protein is accomplished 
by contacting the sample with one or more antibodies which bind marker 
proteins. 

According to another embodiment, the invention provides a 
method for diagnosing hepatocellular carcinoma comprising the steps of 
obtaining a liver tissue sample from a patient, and assessing the level of 
expression of one or more marker genes in the tissue sample, which 
marker genes are differentially expressed in HBxAg(+] cells as compared 
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with HBxAgf,] cells. The reduction of detectable expression of one or more 
marker genes in the sample is indicative of hepatocellular carcinoma. 

In yet another embodiment, the invention provides a method 
for .dentifying patterns in gene expression in a biological sample that are 
altered by hepatitis B x antigen comprising the steps of 
obtaining a biological sample; 
contacting said sample with a probe which detects an 

mRNA which is differentially expressed in HBxAgM cells as compared with 
HBxAgf-J cells; and • 

detecting expression of a gene encoding said mRNA 
detected by the probe. 

Alternatively, the steps for identifying alterations in gene 
expression patterns in the biological sample comprise 

contacting said sample with an antibody which detects 
a protein which is differentially expressed in HBxAgf+) cells as compared 
with HBxAgf-] cells; and 

detecting expression of a gene encoding the protein 
detected by the antibody. 



Abbreviations and ri^rifmrn 


A - Abbreviations 


"ABC" 


avidin-biotin-peroxidase complex 


"bp" 


base pair 


"CAT' 


chloramphenicol acetyltransferase 


"ISH" 


in situ hybridization 


"TTP" 


deoxythymidine triphosphate 


"HBxAg" 


hepatitis B x antigen 


"HBsAg" 


hepatitis B surface antigen 


"HBV" 


hepatitis B virus 


"HCC" 


hepatocellular carcinoma 
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"PCR" polymerase chain reaction 
"RV reverse transcriptase 

"SSC" standard saline citrate solution (0.1 5M saline 
containing 0 015M sodium citrate, pH 7) 

B: Definitions 

"Expression" means, with respect to a gene, the realization 
of genetic information encoded in the gene to produce a functional RNA or 
protein. The term is thus used in its broadest sense, unless indicated to 
the contrary, to include either transcription or translation. 

"Expression level", with respect to a gene means a relative 
expression level as determined by comparison with the expression level of 
the gene in noncancerous tissue. An expression level may be "assessed" 
visually in a sample with the aid of a microscope, such as by considering 
the intensity of a stain for protein encoded by the gene of interest, or by 
considering the relative number of stained versus unstained cells in the 
sample. 

"Hybridization" means the Watson-Crick base-pairing of 
essentially complementary nucleotide sequences (polymers of nucleic 
acids) to form a double-stranded molecule. 

"Marker gene" means a gene which is differentially 
expressed in HCC tumor versus non-tumor tissue. 

"Marker protein" means a protein which is encoded by a 

marker gene. 

Detailed Description of the Invention 

The present invention is a method for determining whether 
tissue from a biopsy represents HCC, based on detection of gene 
expression patterns in cells. Studies of tumor and non-tumor pairs from 
patients demonstrate the differential expression of certain genes in tumor 
versus non-tumor tissue The genes are expressed innon-HCC tissue, but 
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expression is substantially reduced or Undetectable in HCC tumor tissue 
Thus, a reduction in expression of one or more of the marker genes in the 
fissue sample is diagnostic forthe presence of HCC in the patient sample 
At the individual cel. .eve., one or more of the marker genes may not be 
expressed or may be characterized by reduced expression .eve., such that 
the expression of the gene in the tissue sample as a whole is reduced The 
.dentification of such molecular markers provides a method for diagnosing 
HCC wrtbout relying on tissue morphology alone. This is the first time that 
molecuter markers associated with chronic HBV infection have been shown 
to be useful in the diagnosis of HCC. 

The diagnostic marker genes were identified by manipulation 
of HepG2X cells. HepG2 is a differentiated eel, line derived from a human 
hepatoblastoma. The cell line HepG2X was generated by infection of 
HepG2 cells by replication defective recombinant retroviruses encoding the 
full length HBxAg polypeptide. HepG2CAT cells were generated in the 
same manner by substituting the bacteria. CAT gene for the HBV X gene 
«n the transfection vector. The HepG2X cells express the HBV X antigen 
(HBxAgf+]) > while HepG2CAT cells dp not (HBxAg[-J). 

Genes whose expression were either turned on or off in the 
presence of the hepatitis B x antigen (HBxAg) in HepG2 cells are identified 
by PGR select eDNA subtraction. Briefly, the method consists of isolating 
Whole eel, RNA from HBxAg [+] and [-] HepG2 cells. Methods and kits for 
performingPCRseiectcDNAsub^^ 

ava„ab,e, e.g., from Ctontech. Palo Alto, CA. The RNA from HepG2X ce,,s 
•8 subtracted from those in HepG2 cells, providing RNAs expressed in 
He P G2X cells but not in HepG2 cells. The RNAs were then reverse 
transcribed into DMA and then PGR amplified using random primers ,n 
order to obtain RNAs expressed in HepG2. but not HepG2X cells the 
opposite subtraction is carried out. These RT/PCR fragments were then 
cloned and either partially or fully sequenced. 
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Accordingly, equivalent amounts of poly(A) + RNA were 
isolated from confluent cultures of HepG2X and HepG2CAT cells and 
subjected to PCR select cDNA subtraction. DNA strands were individually 
sequenced from every clone, and the results for each compared to entries 
in GenBank and other related databases (Table 1 , below). The PCR select 
cDNA subtraction generated gene fragments from up to eight different 
cellular genes that were detected in HepG2X cells but not in HepG2CAT 
cells (L4, L7, L8, L1 1, L12, L15, L16 and L19). Five of these (L7, L8, L12, 
Li 6 and L1 9) had at least 89% homology with fragments of known products 
from GenBank (Table 1). Interestingly, three of the five sequences. (L7, 
L12, and L19) had homology with factors Upregulated in fetal tissues, 
suggesting that they may have some growth regulatory functions. In 
addition, two fragments (C1 and G2) were apparently present in 
HepG2CAT cells but absent in HepG2X cells. Hence, up to ten genes 
were differentially expressed in HepG2X compared to HepG2CAT cells. 
In the case of the transcripts hybridizing to the L fragments, the clones 
represent fragments of genes whose expression is activated in HBxAgf+J 
cells compared to HBxAgf-] cells. In the case of transcripts hybridizing to 
the C fragments, the clones represent genes whose expression is 
suppressed in HBxAg[+J cells compared to HBxAg[-J cells. The fragment 
size given in Table 1 is considered approximate, as size was estimated 
visually from gels. 
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table 1: 



DiffereritiaJly expressed genes in HBxAg[+] and [-] Hep 
G2 cells 



clone insert GenBank Seamh 
size 



(~bp) Match (and Accession #) 



HBxf+r Clones 



L7 
L8 b 



L16 b 

L19 

L4 b 

L11 

L15 



690 
220 



L12 320 



180 

250 

1700 

580 

1580 



human fetal liver cDNA clone (H49417) 
human tubulin-fokJing cofactOr E cDNA 
(U61232) 

human 40S ribosomal protein 
S15A(P48149) 

human myeloblast K1AA0132 gene 

human fetal heart cDNA (AA026758) 

none 

none 

none 



%homology 
in overlap 



95.7% in 440 bp 
100% in 45 bp 



99% in 65 bp 
99% in 152 bp 



HBxf-F Clones 

C2 b 620 human sail (L26247) 
C1 670 none 



The clones represent fragments of genes whose expression is activated 
(L4,L7X8,L11.L12X15Xl6,L19)orsu^^ 

cells. MlJ 

Probes whose sequences share considerable homology with sequences independently 
found in tumor compared to nontumor cells. 



The cDNA fragments obtained from subtraction hybridization (Table 
1) were used as probes for ISH of HepG2X and HepG2CAT cells, to verify 
that the probes obtained from PCR select cDNA subtraction actually 
represented differentially expressed genes in HepG2 compared to HepG2X 
cells. In all cases, the L probes hybridized to HepG2X cells. Little or no 
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signal was observed in HepG2CAT cells. In contrast, the C probes 
demonstrated strong hybridization in HepG2CAT cells, but little or no signal 
in HepG2X cells. Thus, in all cases, the probes obtained from PGR select 
cDNA subtraction actually reflected differences in gene expression 
between HepG2X and HepG2CAT cells. 

The cDNA fragments were either partially (L4, L7, L8. L15, L16 and 
C2) or completely (L1 1, L12, L19 and CI) sequenced. The sequences are 
as follows: 



Table 2: Nucleotide Sequences of Fragments of Differentially 
Expressed cDNA 

cDNA Fragment Fragment Nucleotide Sequence 

L7 SEQ ID NO:1 

L8 SEQIDNO:2 

L12 SEQ ID NG:3 

L16 SEQ ID NO:4 

L19 SEQ ID NO:5 

t4 SEQ ID NO:6 

' L11 SEQ ID NQ:7 

L15 SEQIDNO:8 

C2 SEQ ID NO:9 

C1 SEQIDNO:10 



in order to further study the structure and function of the protein 
encoded by the C2 mRNA, the full length cDNA containing the C2 
sequence was obtained (from HepG2CAT cells) by 5' and 3" rapid 
amplification of cDNA ends (RACE) PCR using the Marathon™ cDNA 
Amplification Kit (Clontech. Palo Alto, CA). Briefly, one 3' and one 5' gene 
specific primers were synthesized. PCR was performed using these 
primers together with an adaptor primer to obtain the 3' or 5* cDNA specific 
products in separate amplification reactions. The products were cloned 
into pT7Blue T (Novagen, Inc., Madison, Wl) and sequenced The 
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appropriate 3' and 5' gene specific fragments were then digested with 
suitable restriction enzymes and cloned into pcDNA3 (Invitrdgen, San 
Diego, CA) at the chosen site(s), and the integrity of the full length clone 
verified by DMA sequencing. This resulted in a full length clone exactly 
1 35 kb in length, which encoded a small protein of 113 amino acids near 
its 5' end that has 100% homology with the human translation initiation 
factor, hu-suil. The G2 probe spans bases 903-1 350 of full length hu-suil 



cDNA 



Other than its regulatory role in translation initiation, the human suit 
protein does not appear to have any recognizable motifs which would 
suggest additional functions. These results Indicate that the introduction 
of HBxAg results in the altered expression of a protein whose function is 
associated with the regulation of translation. Further. HBxAg may 
contribute to hepato-carcinogenesis. in part, by altering gene expression 
at the level of translation initiation. 

Additional full length cDNAs from differentially expressed genes 
containing fragments L4, L7. L11 and L12 were obtained in a similar 
manner to fragment C2. The cDNA containing fragment L12 encoded a 
protem of 130 amino acids having a 100% homology with the human 40S 
nbosomal protein S15A (Accession nos. P39027, P39031). Sequences of 
the full length cDNAs and corresponding gene names are set forth in Table 
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Table 3: Nucleotide Sequences of Full-length Differentially 
Expressed cDNAs 



cDNA 


Full Length 


Gene Name 


GenBank 


Translated 


Fragment 


cDNA Seq. 




Accession 


Protein Seq. 




(SEQ ID NO.) 




Number 


(SEQ ID NO.) 


L4 


11 


unknown 


- 


27 


L7 


12 

i 


human fetal 
liver cDNA 


H49417 


28 


L11 


13 


unknown 




29 


L12 


14 


human 40S 
nbosdmal 
protein 
ST5A 


P48149 


30 


C2 


15 


hu-su/1 


L26247 


31 



Experiments were performed to detect hu-sui1 transcripts in tumor 
and nontumor tissues from HBV infected patients.. A panel of 
tumor/nontumor tissue pairs from a group of patients were analyzed by ISH 
using the C2 probe. Among this group, 14 patients were from South Africa, 
while the remaining 23 were from mainland China, the results (Table 5, 
Exarhple 3, below) show that hu-sui1 mRNA is easily detectable in 
nontumor tissue from both groups, but that it is rarely present in tumor 
tissues from the same patients Thus, hu-suH is differentially expressed 
in tumor vs. non-tumor tissue. 

ISH was performed with the full set often individual probes (L4, L7, 
L8. L11, L12. L15. L16, L19, C1 and C2) on tumor/nontumor paired 
samples from five HBV carriers with HCC. and on normal uninfected liver 
from two individuals. The probes detected transcripts that were 
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preferentially expressed in nontumor, compared to tumor tissues, in most 
cases (Table 6. Example 4. below). These results were not due to 
differences in the ability of the tumor tissue to uptake probe, since tumor 
cells from three of the five HCC patients hybridized strongly to an alpha 
fetoprotein probe. 

For diagnosis of HCC, a sample of liver tissue is removed from an 
individual by conventional biopsy techniques which are well-known to those 
skilled in the art. Typically, the test subject will be an HBV-infected 
individual. The sample is generally collected by needle biopsy. 
Procedures for liver needle biopsy are well-known in medicine. A mass 
may be apparent from either tactile examination of the patient, Or upon 
imaging such as by ultrasound. The needle biopsy should be taken at or 
near the site of the mass. Ultrasound guided percutaneous fine-needle 
biopsy procedures are known. See, e.g., Polakow, 
Hepatogastroenterology 45:1829-30 (1998). The biopsy sample may also 
be taken in connection with a surgical procedure in which the liver becomes 
accessible. 

The expression level of the marker gene may serve as a convenient 
molecular marker to replace or augment conventional liver tissue 
examination, which largely relies on subjective criteria. This form of 
"molecular-based" diagnosis can be performed more consistently than 
conventional pathological examination which is based upon subjective 
evaluations by expert pathologists. 

Detecting the expression of the marker gene in the tissue sample 
comprises detecting RNA transcripts, particularly mRNA transcripts in the 
sample tissue, or detecting the Corresponding marker gene product 
(protein) in the sample tissue. Preferably, the presence of the marker 
protein in the sample tissue is detected by an immunoassay whereby an 
antibody which binds the marker protein is contacted with the sampled 
tissue. 
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Typically, a portion of noncancerous liver tissue will be removed with 
the purported tumor tissue during biopsy. The noncancerous (i.e., non- 
tumor) cells will express the marker gene, and will provide a positive signal 
for the absence of HCC. Hence, the noncancerous cells in the biopsied 
sample will serve as a convenient positive control. 

It may be desirable in some cases to compare the assay results 
against a control sample comprising liver cells from non-tumor liver tissue 
from the test subject, or non-tumor liver tissue from another (HBV-infected) 
individual. The non-tumor sample should test positive for the expression 
of the marker gene. 

Tissue samples may be considered as HCC-positive when the level 
of expression of one or more marker genes is reduced in the tissue sample 
as a whole, compared to the expression level in noncancerous liver tissue. 
The overall reduction of marker gene expression in the liver sample may 
arise from a reduced but still detectable expression level in at least a 
portion of the cells of the sample, a complete loss of marker gene 
expression in some cells, or a combination of both. The former may be 
apparent as a general lessening of stain intensity when the sample Is 
treated with a stain for cells which express the mairker gene. The latter 
may be observed as a complete absence of stain in the affected cells. 
These observations can be made visually with the aid of a microscope. 

Methods of detecting rnRNA transcripts of a particular gene in cells 
of a tissue of interest are well-known to those skilled in the art. According 
to one such method, total cellular RNA is purified from the effected cells by 
homogenization in the presence of nucleic acid extraction buffer, followed 
by centrifugation. Nucleic acids are precipitated, and DNA is removed by 
treatment with DNase and precipitation. The RNA molecules are then 
separated by gel electrophoresis on agarose gels according to standard 
techniques, and transferred to nitrocellulose filters by, e.g., the so-called 
"Northern" blotting technique^ The RNA is immobilized on the filters by 
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heat,ng. Detection, end quantification if desired, of specific RNA is 
accomplished using appropriately labeled DNA or RNA probes 
complementary to the RNA in question. See Molecular Coning- A 

t * W M9n " al J ' Sambrook * *• 2nd edition. Cold Spring 
Harbor Laboratory Press. 1 98 9. Chapter 7. the disclosure of which is 
incorporated by reference. 

More preferably, the mRNA assay is carried out according to ISH 
Also known as "cytologica. hybridation", the in situ technique involves 
depositing whole cells or .issues onto a microscope cover slip and probing 
the nucleic acid content of the cell with a solution containing radioactive or 
otherwise labeled cDNA or cRNA probes. The practice of ISH is described 
■n more detail in U.S. Paten, 5.427.916. the entire disclosure of which is 
rncorporated herein by reference. 

The nucleic acid probes for the above RNA hybridization methods 
can be designed based upon the full terrgtt, marker gene sequences 
descnbed or referenced herein. Where the marker gene has yet to be 
Ktentified with a known. hrlMength sequenced DNA, the corresponding 
cDNA fragment listed in Table 2 may be used as the probe. 

Methods for preparation of labeled DNA and RNA probes, and the 
conditions for hybridization thereof to target nucleotide sequences are 
described in Motecu/ar Coning, supra. Chapters 10 and 11, incorporated 
berem by reference. The nucleic acid probe may be labeled wfth e o. a 
radionuclide such as «p, »c. or *S : a heavy mete.; or a ligand capable'o, 
functroning as a specific binding pair member for a labeled ligand. such as 
a labeled antibody, a fluorescent molecule, a chemolesceht molecule, an 
enzyme or the like. 

Probes may be labeled to high specific activity by either the nick 
translation method or Rigbyefa/., J. Mo,. Biol. 1,3: 237-251 (1977) or by 
the random priming method. Fienberg e, a/.. Anal. Bioohem 132- 6-13 
(1983). The latter is the method of choice for synthesizing »P-,abeled 
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probes of high specific activity from single-stranded DNA or from RNA 
templates. Both methods are well-known to those skilled in the art and will 
not be repeated herein. By replacing preexisting nucleotides with highly 
radioactive nucleotides, it is possible to prepare 3 *P-labeled DNA probes 
with a specific activity well in excess of 1 0 8 cpm/micrbgram according to the 
nick translation method. Autoradiographic detection of hybridization may 
then be performed by exposing filters oh photographic film. 

Where radionuclide labeling is not practical, the random-primer 
method may be used to incorporate the TTP analogue 5-(N-(N-biotinyl- 
epsilon-aminocaproyl)-3-aminoallyl)deoxyuridine triphosphate into the 
probe molecule. The thus biotinylated probe oligonucleotide can be 
detected by reaction with biotin binding proteins such as avidin. 
streptavidin, or anti-biotiri antibodies coupled with fluorescent dyes or 
enzymes producing color reactions. 

In situ hybridization is most conveniently carried out using a 
commercially available kit for labeling nucleic acid probes with, e:g. 
digoxigenenin/biotin as a label. One such kit is available from Oncor, 
Gaithersburg, MD. 

According to another embodiment of the invention, marker gene 
expression in ceils of the patient tissue is determined by detecting the 
corresponding marker protein. A variety of methods for detecting and 
quantifying expression of proteins of interest exist, including Western 
blotting and immunohistochemical staining. The latter is preferred. 
Western blots are run by spreading a protein sample on a gel, using an 
SDS gel, blotting the gel with a cellulose nitrate filter, and probing the filters 
with labeled antibodies. With immunohistochemical staining techniques, 
a tissue sample is pirepared, typically by dehydration and fixation, followed 
by reaction with labeled antibodies specific for the desired gene product. 
The antibodies may be coupled to a visually detectable label, such as 
enzymatic labels, flourescent labels, luminescent labels, and the like. 
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According to one embodiment of the invention, tissue samples are 
obtained from patients and the samples are embedded and serially 
sectioned at 3-5 u per section. The sections are fixed, mounted and dried 
according to conventional tissue mounting techniques. The fixing agent 
may advantageously comprise formalin. The embedding agent for 
mounting the specimen may comprise, e.g., paraffin. The samples may be 
stored in this condition. 

Following deparaffinization and rehydration, the samples are 
contacted with an immunoreagent comprising an antibody specific for a 
marker protein of interest. The antibody may comprise a polyclonal or 
monoclonal antibody. The antibody may comprise an intact antibody or 
fragments thereof capable of specifically binding marker protein. Such 
fragments include, but are not limited to, Fab and F(ab') 2 fragments As 
used herein, the term "antibody" includes both polyclonal and monoclonal 
anybodies. The term "antibody" means not only intact antibody molecules 
but also includes fragments thereof which retain antigen binding ability 

Appropriate polyclonal antisera may be prepared by immunizing 
appropriate host animals with marker protein and collecting and purifying 
the antisera according to conventional techniques known to those skilled 
»n the art. Monoclonal antibody may be prepared by following the classical 
technique of Kohler and Milstein. Nature 25*493-497 (1975), as further 
elaborated in later works such as Monoclonal Antibodies, Hybridomas- A 
New Dimension in Biological Analysis, R. H. Kennel ef at., eds. Plenum 
Press, New York and London (1980). 

Substantially pure marker protein for use as an immunogen for 
ra,s,ng polyclonal or monoclonal antibodies may be conveniently prepared 
by recombinant DNA methods. 

As an alternative to immunization with the complete marker protein 
anybody against marker proteins can be raised by immunizing appropriate 
hosts with immunogenic fragments of the whole protein, particularly 
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peptides corresponding to probable antigenic determinants. Hydrophilic 
regions, which face the environment surrounding the protein, are most 
likely to contain antigenic sites. Such regions can be identified using 
standard computer programs. 

The antibody either directly or indirectly bears a detectable label. 
The detectable label may be directly attached to the primary anti-marker 
protein antibody. More conveniently, the detectable label is attached to a 
secondary antibody, e.g., goat anti-rabbit IgG, which binds the primary 
antibody. The label' may advantageously comprise, for example, a 
radionuclide in the case of a radioimmunoassay; a fluorescent moiety in the 
case of an immunofluorescent assay; a chemilumiriescent moiety in the 
case of a chemiluminescent assay; or an enzyme which cleaves a 
chromogenic substrate, in the case of an enzyme-linked immunosorbent 
assay. 

Most preferably, the detectable label comprises an avidin-biotin- 
peroxidase complex (ABC) which has surplus biotin-binding capacity. See 
Hsu et a/., J. Histochem. Cytochem. 29:577-580, 1981: The secondary 
antibody is biotinylated. Kits for staining proteins by the ABC method are 
commercially available (e.g.. Vector Laboratories, Burlingame, CA) 

To determine the presence of marker protein antigen in the tissue 
section under analysis, the section is treated with primary antiserum 
against the antigen, washed, and then treated with the secondary 
antiserum. The subsequent addition of ABC localizes peroxidase at the 
site of the specific antigen, since the ABC adheres non-specifically to 
biotin; Peroxidase (and hence antigen) is detected by incubating the 
section with e.g. H 2 0 2 and diarninobenzidine (which results in the antigenic 
site being stained brown) or H 2 0 2 and 4-chloro-1-naphthol (resulting in a 
blue stain). 
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The ABC method can be used for paraffin-embedded sections. 

frozen sections, and smears. Endogenous (tissue or cell) peroxidase may 

be quenched e.g. with H 2 0 2 in methanol. 

The level of marker protein expression in samples may be compared 

on a relative basis to the expression in non-tumor liver tissue samples by 
comparing the stain intensities, or comparing the number of stained cells. 
The lower the stain intensity in the test sample with respect to nontumor 
controls, or the lower the stained cell count in a tissue section having 
approximately the same number of cells, the lower the expression of the 
marker gene in the sample, which indicates the presence of HCC in the 
sample. If a control is utilized, it advantageously comprises non-tumor 
liver tissue from another HBV-positive individual or non-tumor liver tissue 
from the patient. 

As a further control of the protein expression, one may preincubate 
the immune serum raised against the marker protein antigen with the 
relevant peptide antigen. This should dissipate the signal from the immune 
serum when contacted with healthy or non-tumor cells, confirming that the 
immune serum reagent is indeed specific for the target antigen. 

The diagnostic procedure described herein may take the form of 
detecting the expression ofjust one of the marker genes. Alternatively, a 
mixture of probes (nucleic acid or antibody) targeting different marker 
genes may be utilized. This may be achieved by pooling two or more 
nucleic acid probes in the ease of a nucleic acid hybridization assay, or by 
pooling two or more antisera in the case of a protein assay, By testing for 
the expression of multiple marker genes in this manner, it is expected that 
the sensitivity of the assay will be increased. 

The following nonlimiting examples are provided to better illustrate 
the claimed invention. 
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Example 1 

Preparation of HBxAg[+] and HBxAg[-] Cell Lines 

A. Cell Lines and Culture Conditions 

HepG2 cells, a differentiated cell line derived from a human 
hepatoblastoma (Aden.D.P. et at 1979 Nature 282:615-617; Knowles, 

B. B. et al, 1980. Science 209:497-499), were cultured on type-1 rat tail 
collagen (Becton Dickinson, Franklin Lakes, NJ) coated tissue culture 
dishes or plates. Cells were grown in Earle's MEM supplemented with 1 0% 
heat inactivated fetal calf serum (PCS), 100 uM MEM non-essential amino 
acids, 1 mM sodium pyruvate, as well as standard concentrations of 
penicillin plus streptomycin. The retrovirus packaging cell line PA317 
(Danos, O. 1991. Methods in Molecular Biology, Practical Molecular 
Virology: Vital Vectors for Gene Expression 8:17-27) was also grown on 
plastic dishes in the same medium. 

B. Plasmid Construction 

The retroviral vector plasmid, pSLXCMVneo, was used to clone the 
HBVX gene (Valenzeula, P et al. I960. Animal Virus Genetics, Academic 
Press: New York, pp. 57-70) or the bacterial chloramphenicol 
acetyltransferase (CAT) gene sequences for these studies, as described 
(Duan, L.X. etal. 1995, Human Gene Then 6:561-573). Briefly, pSLXCMV- 
CAT was constructed by inserting a 726 bp Hindlll-BamHI fragment 
containing the CAT gene into the Hpal-Bglll site of the pSLXCMV 
polylinker. PSLXCMV-FLAG-HBx was constructed by inserting a 920 bp 
Mlul-Bglll fragment of FLAG-HBx DNA into the Mlul-Bglll site of the pSLX- 
CMV polylinker. Recombinants were used to transform HB1 01 . Minipreps 
were prepared and the DNA used for sequence analysis. 
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c - Preparation of Recombinant Retroviruses 
and Infection of Her>G9 Crtl* 

Approximately i x 10* PA317 cells/100 mm dish were transfected 
using standard calcium phosphate precipitation using 15 ug of pSLXCMV- 
FLAG-HBx or 15 ug of P SLXCMV-CAT. At 24, 48, and 72 hours after 
transfection, the medium was removed and processed through a 0 45 pm 
filter to remove PA317 cells, and then used immediately for infection of 
HepG2 cells. Five ml of recombinant retrovirus-enriched supernatant (5 x 
1 0 s CFU/ml, as assayed on NIH-3T3 cells) was used to infect 1 x 1 0 6 target 
HepG2 ceils/100 mm dish in the presence of polybrene (8 pg/ml) for 24 
hours. Fresh virus supernatant was added after 24 and again after 48 
hours so that the cells were exposed to virus for a total of 72 hours. All of 
these infections were earned out in log phase cultures. Cells were then 
passaged at 1.2 and selected by incubation in G418 (800 pg/ m |- 
GIBCO/BRL, Grand Island, NY) for 14 days in order to maximize the 
fraction of cells producing HBxAg or CAT. G418 colonies were then 
expanded in normal growth medium and used for analysis. The fourteen 
day selection in G418 had the effect of eliminating most of the uninfected 
ceils. 



°- Pe te*™ of CAT Activity and HBxAg Polvnep tide in Tr^f*^ 

The transfectants (HepG2-CAT and HepG2X) were evaluated as 
follows, 

CAT assays were performed as described by Wang et al (1994 
Proc. Natl. Acad. Sci. USA 91:2230-2234). Briefly. 1 x 10' HepG2-CAT 
cells in a 100 mm dish were lysed by addition of 0.9 ml of 1x report lysis 
buffer (Promega) for 15 minutes and harvested by scraping. Cells were 
pelleted and 180 pi of cell lysate was used for a standard CAT assay After 
incubation with "C-ch.oramphenicol, acetylated forms were separated by 
thm-layer chromatography. Alternatively, lysates prepared from 5 x 10 6 
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HepG2X cells were assayed for the 17 kDa HBxAg by western blotting 
using a mixture well characterized rabbit anti-x peptide antibodies 
(Feitelson, M.A. and M.M.Clayton. 1990. Virology 177:367^371; Feitelson, 
M.A.etal.1990. GasfroenteTO/ogy98:107i-1078). Horseradish peroxidase 
conjugated goat anti-rabbit lg (Accurate, Westbury, NY) and ECL substrate 
(Amersham, Arlington Heights, IL)were used for detection. 

GAT activity was present in HepG2CAT, but not in HepG2X cells. 
HBxAg was present in lysates from HepG2X, but not from HepG2CAT 
cells. Together, these findings show that both of the recombinant 
retroviruses are expressing the expected products in HepG2 cells. 



Example 2 

Identification of Differentially Expressed Genes 
Distinguishing HepG2X from HepG2CAT 

The differences in gene expression which distinguish HepG2X from 
HepG2CAT cells were determined by using a commercially available 
subtraction hybridization approach (the PCR-select cDNA subtraction kit 
from Clontech, Palo Alto, CA). Briefly, whole cell RNA was extracted 
separately from 1 x 1 0 7 HepG2X and ah equal number of HepG2CAT cells, 
and the quality of the extraction was determined by assaying for 18S and 
28S rRNAs by agarose gel electrophoresis and ethidium bromide staining. 
PCR-select cDNA subtraction is reverse transcriptase (RT)/PCR based, 
and enriches for poly A* RNA (isolated using the Qiagen RNeasy total RNA 
kit; QIAGEN, Inc., Chatsworth, CA)from tissue culture cells or tissues. The 
procedure involved ligating adaptors to some of the PCR products and 
conducting two rounds of subtractive hybridization against the PCR 
products from the cells in which the comparison were being made. The 
resulting products were then PCR amplified using primers which matched 
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the sequence of the adaptors (in the CLONTECH Advantage oDMA PCR 
lot). The unique fragments were then eluted from the gels (using the 
QIAGEN gel extraction kit) and oloned into pT7Blue (Moyagen. Madison 
Wl). Positivecloneswarasetectedbybtue-whitephenotype. Recombinant' 
DNAs ware isolated from minipreps of individua. clones, digested by R sa 
I to check insert size, and then both strands individually analyzed by 
sequence analysis. The sequences obtained were then compared to those 
« GenBank using the FASTA command in the GCG software package for 
homology ,o known genes. The results are set forth in Table 1, above. 

Examp le 3 

Detection of Hu-suH Differential Expression in Patient 
Tumor and Non-tumor Tissue by In situ Hybridization 

These experiments were performed to detect Uu-suil transcripts in 
a tumor and nontumor tissues from HBV infected patients. Accordingly a 
pane, of tumor/nontumor tissue pairs from a group of HCC patients was 
analyzed by ISH using the C2 probe. 

The HCC and surrounding nontumor liver tissues were obtained 

from two different sets of HCC naNpnfe th~«u » • - 

nuo patients. The characteristic of the patients 

are set forth in Table 4: 
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Table 4: Characteristics of HCC patients in study 

Patient group African Chinese 

Number tested 14 23 

Race 13 black, 1 Caucasian 23 Chinese 
Gender 14 male. 0 female 17 male, 6 female 

Age range: 14-72 years 31-68 years 

mean: 39 years 48 years 

No. HBsAg* of total tested 8 of 14 tested (57%) 9 of 23 tested (39%) 

No HBeAg* of total tested 2 of 13 tested (15%) 14 of 23 tested (61%) 

No. anti-HBc* of total tested 1 3 of 14 tested (93%) 1 9 of 23 tested (83%) 

No anti-HBc* of total tested 2 of 1 3 tested (1 5%) 1 0 of 23 tested (43%) 

No. anti-HBs* of HBs tested 5 of 6 Ht?sAg [-] cases 6 of 14 HbsAg {-J cases 

in Table 4, HBsAg and HBcAg are, respectively, hepatitis B surface 
antigen and core antigen. HBeAg is hepatitis B e antigen, a proteolytic 
fragment of HBcAg which is secreted as a free polypeptide into the blood 
of patients who replicate virus in the liver. HBeAg has thus been described 
as a surrogate marker for virus replication 

Twenty-three paired tumbr/nontumor samples came from as many 
HBsAg positive Chinese carriers who had undergone surgery for the 
removal of their tumors. Most patients lived in and around XPan, China and 
were treated at the Fourth Military Medical University. Fourteen additional 
paired tumor/nontumor samples from as many patients were obtained from 
South African patients. Half of these were HBV carriers (serum HBsAg 
positive) while the remaining patients, except for one, had evidence of past 
HBV infection (detectable anti-HBs and/or anti-HBc). Formalin fixed, 
paraffin embedded tissues, fresh frozen blocks, and -80 P C snap frozen 
paired liver and tumor samples from individual patients were collected from 
most patients, used for diagnostic purposes, and were then made available 
for these studies. Analogous pieces of uninfected human liver from two 
individuals were available to serve as controls. 

Gene fragment C2 obtained from PCR select cDNA subtraction was 
used as a probe in in situ hybridization using the Oncor ISH and 
digoxigenenin/biotin detection kits according to the instructions provided by 
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.he wto. (Ohcor. GaHhersburg. MD). The results, shown in Table 
5. demonstrate that hu-*, W mRNA is easiiy deteaable in nontumor tissue 
from both groups. but tet tt fe rare|y fc ^ ^ ^ 

same p^nts. For example. « of ,4 Soum African patients^) and 22 
of 23 Chinese pabents (96%) had detectable hu-su/, mRNA by rsH h 
nonn,mor ce„, In contrast. only , South African (7%) and 5 Chinese 
(22*) had detectable hu-su/f mRNA by ISH In tumor tissue. Among ^ 
Cheese patients with detectable hu-su;, in HCC. 3 of ihe 5 had only trace 
amounts of signa. in tes S than ,0% of the tumor cells Nontumor tissue 
=.gna,s were often more intense and more widespread. These patterns 
were observed in both HBsAg positive and negative patients with HCC in 
both ethnicgroups. These results demonsbate that hu-su,', is differentia,^ 
expressed in tumor compared to nontumor tissue. 
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Table 5: Summary of in situ hybridization for C2 probe in tumor 
/nontumor pairs for HCC patients from south Africa and 
China* 
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3 In situ hybridization (ISH) staining is estimated as follows: 0: no signal; 1: 
ISH signal in <10% of cells; 2: ISH signal in 10-25% of cells; 3: ISH signal 
in 25-50% of cells; 4: ISH signal in >50% of cells. 



25 Example 4 

Detection of Differential Expression in Patient 
Tumor and Non-tumor Tissue by In situ Hybridization 



The HCC and surrounding nontumor liver tissues used for analysis 
were obtained from five HBsAg positive Chinese carriers who had 
3 0 undergone surgery for the removal of their tumors. These patients were 
treated at the Fourth Military Medical University, Xi'an, China. Formalin 
fixed, paraffin embedded tissues, fresh frozen blocks, and -80°C snap 
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frozen paired liver and tumor samp.es from individua. patents were 
collected, used for diagnostic purposes, and Were then made available for 

ttresestudies.lnmanycases.tumorwasdissectedfromnpntumorjustpnor 

to snap freezing: Analogous pieces of uninfected human liver from two 

individuals were available to serve as controls. 

In situ hybridization was carried out using probes L4 L7 L8 L1 1 

L12. L15, L16. L19. CI and C2. and the Oncor ISH ' and' 
rhgoxigenenin/biotin detection kits according to the instructions provided by 
the manufacturer (Oncor, GaKhersburg. MD). The results are shown in 
Table 6. The probes detected transcripts that were preferentially 
expressed in nontumcr. compared to tumo, tissues, in most cases These 
results were no. due to deferences in the ability of the tumor tissue to 
uptake probe, since tumor cells from three of the five HCC patients 
hybnd^ed strongly ,o a 320 bp alphafetoprotein probe (data no, shown, 
Hence, the probes that distinguish HepG2X from HepG2CAT cells also 
distingursh tumor from nontumor in carriers with HCC. 
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Table 6: In situ hybridization results from PCR select cDNA 
amplification 
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a y/7 s/fu hybridization (ISH) signals were as follows: ^ : no signal; ± * 1 -1 0% of the cells were 
positive for the corresponding probe; + : 11-25% of the cells were positive; ++ : 26-50% of 
the cells were positive; +++ : >50% of the cells positive. In the great majority of cases, 
HbxAg was observed in nohtumor liver Tumor cells were either faintly positive for HbxAg 
or completely negative 



25 Example 5 

Detection of Differential Expression in Patient 
Tumor and Non-tumor Tissue by Immunostaining 



30 



A. Peptides 

Synthetic peptides that represent probable antigenic determinants 
on each of the differentially expressed proteins were prepared by solid 
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phase peptide synthesis and analyzed by HPLC and amino acid 
composition prior to use. The peptides are identified in Table 7, below. 
The The peptides were coupled by virtue of their free cysteine sulfhydryl 
(either in the peptide sequence or added to the carboxy or amino terminus 
where the native sequence did not contain a cysteine) to keyhole limpet 
hemocyanin (KLH; Sigma) using the coupling agent m-maleimidobenzyol- 
N-hydroxysuccinimide ester (MBS; Pierce) as described by Liu et at.. 
Biochemistry 18:690-697 (1979). 
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Table 7: 



Peptide fragments of differentially expressed proteins 



Gene Peptide Sequence 



C2 C2.1 DDYDKKKLVKAFKKKFAC 
(SEQ ID NO:16> 

C2.2 EHPEYGEVIQLQGDQRKNIC 
(SEQ ID NO: 17) 



Peptide 
Position 
in Protein 

52-69 



75-94 



10 L4 L4A CQKAKDRMERITRKIKDSDAYRRDE 460-484 

(SEQ ID NO: 18) 

L4B PRPRDKRQLLDPPGDLSRC 821-838 
(SEQ ID NO: 19) 



L7 L7A CGVWNQTEPEPAATS 
15 (SEQ ID NO:20) 

L7B HHHGRGYLRMSPLFKC 
(SEQ ID NO:2l) 



12-25 



56-70 



L11 1L11 PCPELACPREEWRLGP 2-17 
20 (SEQ ID NO:22) 

3L11 DPSRSPHSTSSFPRGSSATSCDSR 316-339 
(SEQ ID NO:23) 

4L11 HPPDGSFSTFHDGPQPLEDPC 359-378 
25 (SEQ ID NO:24) 



L12 L12.1 KSINNAEKRGKRC 12-23 
(SEQ ID NO:25) 

L12.2 DHEERRRKHTGGKC 112-124 
3 0 (SEQ ID NO:26) 
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B Antibody P roduction 

For antibody production, 5- to 10-week old female New Zealand 
Whrte rabbits (2 animals/peptide; Hazelton) were bled and then injected 
wrth peptide conjugate as described (Bittle etal, Nature 298: 30-33 1982) 
Dilutions of immune sera were assayed in parallel with preimmune sera in 
sol.d-phase assays in wells (Immunolon 2 Removawell Strips, Dynatech 
Labs) coated With the appropriate (unconjugated) synthetic peptide See 
Fertelson etal., Gastroenterology 98: 1071-1078, 1990 or Feifelson etal. 

^eo-^/24:121-136,1988foradditiona,detan S ofthe solid phaseassay 
design. 

C - lmmUPOhistochemir. a f ^fipfpg 

Antisera generated from two or three peptide antigens of the same 
prote,n (Table 7) are pooled, and used in immunohistochemical staining 
assays as follows. Paired tumor/nontumor tissue samples from HBV- 
associated liver cancer HCC patients comprise the test samples. 

Tissues are fixed in 10% formalin, embedded in paraffin and serially 
sect,oned at 5m per section. Sections are then stained for individual 
drfferent-ally expressed proteins by the avidirvbiotin complex (ABC) method 
(Hsu et aL, J. Histochem Cytochem 29: 577-580) using a kit purchased 
from Vector Laboratories (Burlingame, CA). Staining is detected by 
addrt,on of diaminobendizine (DAB) substrate, and the sections then 
counterstained with Mayer's hematoxylin. The degree of positive reaction 
■s scored from 0 to +++ . The grade 0 indicates no demonstrable antigen 
+ mild. ++ moderate and +++ dark staining. 

D BgSU ttS . C2.1/C2.2 Antihrvrty M ?Ynirr 

Staining with antisera comprising antibodies against the C2 1 and 
C2.2 antigens indicated the presence of the Sui1 protein by brown color in 
the cytoplasm of nontumor celb surrounding the HCC tumor tissue Very 
little staining appeared in the tumor cells of the HCC tissue 
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D Controls 

The specificity of staining for differentially expressed protein was 
demonstrated by the following controls, (a) Several uninfected human liver 
samples, and several tissue types from other organs (spleen, lymph node, 
muscle, nerve, and gall bladder), were tested with immune sera, (b) 
Preimniune and normal rabbit sera were tested with positive liver sections; 
(c) The synthetic peptide(s) used to raise immune sera was tested for 
blocking of staining when preincubated with corresponding antisera prior 
tp staining, (d) Liver powder made from uninfected human livertissues was 
used to absorb the primary antibodies prior to staining. Peptide antisera 
were tested by western blotting with E. colt lysate from bacteria expressing 
the corresponding L or C polypeptide compared to a similar lysate from 
untransfected host cells. The results of each of these control procedures 
supported the specificity of staining; 

All references cited with respect to synthetic, preparative and 
analytical procedures are incorporated herein by reference. All sequence 
records identified by GenBank accession numbers are incorporated herein 
by reference. 

The entire disclosure of U S. provisional patent application Serial Mo. 
60/072,938 filed January 29, 1998, is incorporated herein by reference. 

The present invention may be erhbodied in other specific forms 
without departing from the spirit or essential attributes thereof and, 
accordingly, reference should be made to the appended claims, rather than 
to the foregoing specification, as indication the scope of the invention. 
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What is claimed 

t. Amethdd fordetecting hepatocellular carcinoma in livertissue 
of a patient comprising: 

assessing the level of expression of one or more marker 
genes in a liver tissue sample from the patient, which marker genes 
are differentially expressed in HBxAg(+] cells as compared with 
HBxAgf-] cells, a reduction in the level of expression of said one or 
more marker genes in the sample as compared to the expression 
level in noncancerous liver tissue being indicative of hepatocellular 
carcinoma in the sample. 

2. The method according to claim 1 wherein the one or more 
marker genes is selected from the group of genes expressing an RNA 
transcript which hybridizes under conditions of high stringency to a nucleic 
acid probe selected from the group consisting of SEQ ID NO:t, SEQ lb 
NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID 
NO:7, SEQ ID NO:8, SEQ ID NO:9. and SEQ ID NO:10. 

3. The method according^ claim 1 wherein the marker gene is 
selected from the group consisting of 

a gene which encodes the polypeptide of SEQ ID NO:27; 
a gene which encodes the polypeptide of SEQ ID NO:28; 
a gene which encodes the polypeptide of SEQ ID NO:29; 
a gene which encodes the polypeptide of SEQ ID NO:29; 
hu-sur, 

human tubulin-folding cofactor E gene; 
human myeloblast KIAA0132 gene; and 
the human fetal heart gene, the cDNA of which is identified 
as GenBank accession number AA047006. 
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4. The method according to claim 3 wherein the marker gene is 
hu-su/f . 

5. The method of claim 1 wherein the step of assessing the 
expression of said one or more marker genes comprises contacting said 

5 sample with one or more probes which detect mRNA which is differentially 
expressed in HBxAgl+J cells as compared with HBxAg[-J cells. 

6. The method of claim 5 wherein the mRNA detected hybridizes 
under high stringency conditions to a nucleic acid probe having a 
nucleotide sequence selected from the group consisting of SEQ ID NO:1, 

10 SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5 T SEQ ID NO:6, 
SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, and SEQ ID NO:10. 

7. The method of claim 1 wherein the step of assessing the 
expression of said one or more marker genes comprises detecting marker 
protein encoded be said one or more marker genes. 

is 8. A method according to claim 7 wherein the one or more 

marker genes is selected from the group of genes expressing an RNA 
transcript which hybridizes under high stringency conditions to a nucleic 
acid probe selected from the group consisting of SEQ ID NO:1, SEQ ID 
NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID 

2 0 NO:7, SEQ ID NO:8, SEQ ID NO:9. and SEQ ID NO:10. 

9. The method according to claim 7 wherein the marker gene is 
selected from the group consisting of 

a gene which encodes the polypeptide of SEQ ID NO:27; 
a gene which encodes the polypeptide of SEQ ID NO:28; 
25 a gene which encodes the polypeptide of SEQ ID NQ:29; 
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a gene which encodes the polypeptide of SEQ ID NO:29; 
hu-sur, 

human tubulin-folding cofactor E gene; 
human myeloblast KIAA0132 gene; and 
the human fetal heart gene, the cDNA of which is identified 
as GenBank accession number AA047006. 

10. The method of claim 7 wherein the one or more marker 
proteins are detected by contacting the sample with one or more antibodies 
which bind said marker proteins. 



11- A method according to claim 10 wherein the marker 
hu-suil. 



gene is 



12, A method for diagnosing hepatocellular carcinoma 
comprising: 

obtaining a liver tissue sample from a patient; 

assessing the level of expression of one or more marker 
genes in the sample, which marker genes are differentially 
expressed in HBxAg{+J ceils as compared with HBxAg[-] cells, a 
reduction in the level of expression of said one or more marker 
genes in the sample as compared to the expression level in 
noncancerous liver tissue being indicative of hepatocellular 
carcinoma in the sample. 

13. The method according to claim 12 wherein the one or more 
marker genes is selected from the group of genes expressing an RNA 
transcript which hybridizes to a nucleic acid probe selected from the group 
consisting of SEQ ID NO 1, SEQ ID NO:2, SEQ ID NO:3. SEQ .D NQ 4 
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SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7 ? SEQ ID NO;8, SEQ ID NO:9, 
and SEQ ID NO: 10. 

14. The method according to claim 12 wherein the marker gene 
is selected from the group consisting of 

a gene which encodes the polypeptide of SEQ ID NO:27; 
a gene which encodes the polypeptide of SEQ ID NO:28; 
a gene which encodes the polypeptide of SEQ ID NO:29; 
a gene which encodes the polypeptide of SEQ ID NO:29; 
hu-su/; 

human tubulin-folding cofactor E gene; 
human myeloblast KIAAG132 gene; and 
the human fetal heart gene, the cDNA of which is identified 
as GenBank accession number AA0470Q6. 

1 5. The method according to claim 14 wherein the marker gene 
is hu-suil. 

16. A method for identifying alterations In gene expression 
patterns in a biological sample that are induced by hepatitis B x antigen 
comprising the steps of 

obtaining a biological sample; 

contacting said sample with a probe which detects an mRNA 
which is differentially expressed in HBxAg[+] cells as compared with 
HBxAgH cells; arid 

detecting expression of a gene encoding said mRNA detected 
by said probe. 

17. The method of claim 16 wherein the biological sample is liver 

tissue. 
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18. The method of claim 17 wherein the liver tissue comprises ; 
hepatocellular carcinoma. 



19- A method for identifying patterns in gene expression in a 
biological sample that are altered by hepatitis B x antigen comprising the 
steps of 

obtaining a biological sample; 

contacting said sample with an antibody which detects a 
protein which is differentially expressed in HBxAgf+] cells as compared with 
HBxAgf-] cells; and 

detecting expression of a gene encoding said protein 
detected by said antibody. 

20. The method of claim 1 9 Wherein the biological sample is liver 

tissue. 

21 . The method of claim 20 wherein the liver tissue comprises a 
hepatocellular carcinoma. 
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<110> Feitelson, Mark 

<120> Molecular Markers for Diagnosing Hepatocellular 
Carcinoma 

<130> 36seq 

<140> Not Yet Assigned 
<141> 1999-01-15 

<i50> 60/072/938 
<15.1> 1998-01-29 

<160> 26 

<170> Patent In Veir. 2.0 

<210> 1 
<2I1> 488 
<212> DNA 

<213> Homo sapiens 
<400> L 

tgtccccact cttcaaagcc aagatggtag ctgccatccc tggggagcct ggaaccaggc 60 
aatgttcggg ggaggcaggg gacaggctgg aaectggtga agtcttaaag taaactcctc 120 
ctatcggggt gtagaaggga atctgttaat caaacagagc aatattagaa aggctacaga 180 
ggtcaactca gtggaacacg gttctcccaa acagattttg taattccgaa aatccacgca 240 
tgcgcaaaca tacgcataca ctcccatgtt cctggacagt ttatagctac cataacctgg 300 
cattttccaa aacataccat gtagactctt ggatacacaa ggtaatttta gragceacatt 360 
acgatgaacc ttttaaaaag ttatcattta tttttatntt cccccactgg ctgtattata 420 
agacaatttt tatatgtgat atgtatttac cttagtgtgt taaataaaca cnggcattcc 480 
ctaaaaaa 488 



<210> 2 

<211> 90 

<212> DNA 

<213> Homo sapiens 



<400> 2 

acctgcccgg gcggccgctc gagccctata gtgagtcgta ttagaggccc ttgtagcgtg 60 
aagacgacag aaagggcgtg gtgcggaggg 90 

<210> 3 
<211> 360 
<212> DNA 

<213> Homo sapiens 



1 
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<400> 3 

tgaaaaccaa actggcggga tggaagcaga ttattctgcc atttttccag gtetttqagt 60 
tgcacgtca.a atctggggct gatcacccca cacttgttta gcctgcctgt gaggttcaca 120 
acaattttcc cagctctgtg gtcatcaatg atttcaaatt cgccaatgta accatgcttc 180 
atcatcacag tgagaaaccg gacgatgact ttggagcacg gcctaataag cacctggcgt 240 
ttlT.T" tttC9gCatt «"g.t.ctc ttgagagcat ctgccaggac attcat^cgc 300 
accattgtgg cggcgcggaa aacctgcccg ggcggccgct cgaaatccat atgactagta 360 

<210> 4 

<211> 90 

<212> DNA 

<213> Homo sapiens 

<400> 4 

acctgttgag gcacttttgt ttcttgggca aaaatacagt ccaatggaga gtatcattgt 60 
ttttgtaccg ccctccgcac cacgccctaa 



90 

<210> 5 

<211> 199 

<212> DNA 

<213> Homo sapiens 

<400> 5 

tacgggaagg cgaagaaaag aatagagaag atagggaaat tagaagataa aaacatactt 60 
taZ! 9aaa aaagataaat "aaacctga aaagtaggaa gcagaagaaa aaagacaagc 120 
taggaaacaa aaagctaagg gcaaaatgta ctgaaaatca agatcaagcg agcttttgcc 180 
crttctgctcc acgggaggt 

199 

<210> 6 

<211> 658 

<212> DNA 

<213> Homo sapiens 

<4 0O> 6 



tcagggaaag ccttcacaga tcagtcaata aatacggtgc catgggatgt gccttgcaca 60 
ccacgggcac tcacatcttg aatgctggtc cactggaggc ccttgggttg ggcgggagca 120 
aggcctactt ctgcttcctc aggacaactt ccccacctct gtcctgggac cacctgcccg 80 
ctata 9 ca C o 9 W.egct«ct cccactccag gggccagtga cagagagcag 240 

cttotctccc f CCC r CCC gCag9atCCt tgagacagaa caaactgctg 300 

cttgtctccc taccctgggg gctgtgatat tcttggtaac atctctgagc tggtctgtga 360 
ggtcacttcc tcttttaaca ctgttgagga gactccaaac cctctgtctt ctgctcgtct 420 
tctcatgtcg attgggcacc agccattctc aggcaccaga gcacagcccc acacgggtgc 480 
^ ^otgaac acagcagcct cctacacctg aactgggttt ctctgcaclc 540 
tcacagccgt ctcaccagct caatgagctg ctggatgttt ttgttttggt tcgacaagcc 600 
gttcctgatg ttttccagta ggcatctctt caattcaaat atggcttcac tgtaagcc 658 



<210> 7 
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<212> DNA 

<213> Homo sapiens 



<4Q0> ? 

tcgaggcggt accgggnncc gatttgtagc gtgaanacga oagaaaaggc gtggtgcgga 60 
gggcggtgta gcgtgaagaq gacagaaagg gcgtggcgga gggcggtacc taccctttcc i20 
accctgacgg ggagtgctgc cccgtgtgcc gagactgcaa ctacgaggga aggaaggtgg 180 
cgaatggcca ggtgttcacc ttggatgatg aaccctgcac ccggtgcacg tgteagctgg 240 
gaagaggtga gctgtgagaa Sgttccctgc cagcgggcct gtgccgaccc tgccctgctt 300 
cctggggact gctgctcttc ctgircdagat tccetgtetc ctctggaaga aaagcagggg 360 
ctctcccctc acggaaatgt ggcattcage aaagctggtc ggagcctgca tggaagacac 420 
tgaggcccct gtcaactgta gctcctgtcc tgggcccccg acagcatcac cctcgaaggc 480 
cggtgcttca tctcctccag ctccttttaa gaacgaactt gatgaaaaca cagaetttac 540 
ctaeaagccc ggcaggagct catggtccac actcactcgc tttggggctg acagccactt 600 

<210> 8 
<211> 532 
<212> DNA 

<213> Homo sapiens 
<4 00> 8 

aagccccccc ctttcccttn tttttcttnc ccaanngggg natnccggcc cnttnggntg 60 
aaccaantta aennatc.ccc naaanatggt nccccnntgg aataanccnt gggttactnt 120 
ccaaaccaat tttaccccta tttactttaa atggaattaa ancctccctn tttcattttt 180 
aaagggatca nggtgaaaat cccnttgnaa aaccccccna accaaaancc bcttaaantt 240 
nattttccct tccccgggaa cttncnaacc cngtaaaaaa anaaanaang gtttccncnn 300 
aaatncnttt tttccgcccn ctttatatcg ncgccttngn aaaananaaa aaancccccc 360 
nccccccggg tggtttttnt ncccggattt aaaaancccc nntctttttt ccaaaggttt 420 
ccgghtnccc ccaaaacncc aanacccncn ntcggncccc ccttttttgc cccngntnng 4 80 
ggcccccncc naaaaatctt ntccccccnc cntcatcncn cgctngnnaa tn 532 



<210> 9 
<211> 425 
<212> DNA 

<213> Homo sapiens 
<400> 9 

gctttttttt tttttttttt tttttttttt 
aagggtttaa tacattacac ataacattaa 
gtttgttact tcacatggca ttgggcagct 
gctacatgac tgatggatca gtttgagatt 
gaaggttggc ctcacattot gatgtttgga 
cagacetttg tggcaagcca gatgtcctat 
agctctgtcc acctagtcag gttggaaaca 
aatag 

<210> 10 



txtatttctg aaaacaagtt ttatttaaat 60 
aactgaaggg gaaaaaaaaa ccaaaaacca 120 
gctgctatta agttgcaagc tctacagcta 180 
tgttcccttg tcaaaagttt aactctgata 24 0 
catcccttag ctaggatatg tctggtcgaa 300 
cacctcccta gcggtaagag ggcctctttg 360 
ccaggggatc taccacaaaa gctcccttct 420 

4 25 
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<212> DMA 

<213> Homo sapiens 

<400> 10 

rr 9ao ° 9 " s °^ « 

cttgtagcgt gaaaacnaca oaaaao^Li- " raCCCQ tatagtgagt cgtattaggc 180 
ctatagtgag tcgtattanq ccctara,,^ = «. Z ! c 9 aca 9aacc cgcncgagcc 300 

ag 9 gagtcg 5 ™^ ^™ ™,, 9 ^ 36 » 

gtattangcc ctatagtgag tcqtattaca fJLZ* * ta ^ ta 9ccet atagtgagtc 4 20 
tgagtcgta.tr tangccctat £££££ attacoccct T ^ccctatag 4 80 

ctatagtgag tcctattagg ccttgt^cg t a" ^ 
ttaaaaatcc atatgactaa tnnatr^^ agccctatan ttgagtcgta 600 

cctawnt tg agtcg taS I,™," * r tMS " **-«-• W0 

693 

<210> 11 

<211> 3607 

<212> DNA 

<213> Homo sapiens 

<4O0> 11 

™ t g c c a 9 tzztiit ™ g 9 :: :~ tg " agaa9t - c — «> 

.ggagatgat l g l g l g £l 120 
tccaacagtg gagagaagca ggcttcaaga aa'tgctg"' £££££ r^*"* "° 
ccaggtccag aaactcagcc tccaggactc tctgcaaatc a^t^? tagagacgta 240 

ctgggeccct caggttccca aagacttgcc ctggaatt^ T£££ 3 °° 
caatgctgat gccaggaata ccactatggt gctLcata rtc^T t9Cag9CCCt 360 
ggagaaggag agccagatgg aagaggagat catctacto! CtCCC39aCg cca ^cctgt 420 
tgccgacatt tattcctttt ctLlll "tctactgg gacccagctg atgaccttgc 480 
ccttctctgt gccctStgc tlllTclll ca^tT T —ttaga 540 

aatggccctc tgccagtttg cactc™^ "f"^ 9 ca *<=aagaaa tagcgttgaa 600 
tacatttctg ctgtgggcc! ZaoTaZll ^ctcggaga accactacca 660 

^catggg 9 j£Z£ I~ £~ ™* ™ 

cgtgcgcatg gacgtcagta gcaactCcL g^ccagctt otllt " CCt ^ Ctt 780 
gggccacagg cagtgggact gcttctggca !cgggacct c ^ tCCtCI * CCC 840 

ggagatttcg gatgggttgg tagaaatttc ctggtttttt «Z CC3atgCCCg 900 
ggacattttc ccagaaccto ta fl rrf^,f' ^ttttt eccagcggaa gggaggactt 960 

***** ,. gc \ c : t c ;: ^™ ::x 9 ; ssss r clc T 9 1020 

toct g t g <:t g r..«ac«= llltllT" a 9 atttct 9 a a t aa g tt„t l20 0 

c-a g «tc 9 t 9 " a " t J" 9 O"" 9 "'" « g .c. g c g , 1260 

99t «« gtg ga93 , 9 :.t 99 = : 9 s t: 11 ~£ g c :i~ r 9c,99c9 1320 

« 9t9 . gg . 9 tgtcogaa , g „„, g „ c9 ^ « - : 9 =9 a 380 
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ctcggatgcc tacagaaggg acgagctgag gctgcagggg gacccctgga gaaaggcagc 1500. 

ccaagtggag aaggagttct gccagctcca gtgggccgtg gacccccctg agaagcacag 1560 

ggctgagctg aggcggcggc tgctagaact tcgaatgcag cagaacggcc atgattcctc 1620 

ctcgggggtg caggagttca tctcggggat cagcagcccc tccttgagtg agaagcagta 1680 

cttcctgagg tggatggagt ggggcctggc acgggtggcc cagccgcgac tgagacagcc 174 0 

tccggagacg cttctcaccc tgagaccaaa gcacgggggc aqcacagacg tgggggagcc 1800 

gctctggcct gagcccctag gggtggaaca cttcttgcgg gagatgggac agttttatga 1860 

ggctgagagc tgtcttg.tgg aggcagggag gctgccggca ggccagaggc gttttgccca 1920 

cttcccaggc ttggcotcgg agctgctgct gacagggctg cctctggagc taatcgatgg 1980 

gagcacgctg agcatgcccg tccgctgggt cacagggctc ctgaaggagc tgcacgtccg 204 0 

actggagaga cggtcaaggc tggtggttct gtcaaccgtc ggggtgccag gcacgggcaa 2100 

gtccacactc ctcaacacca tgtttgggct gcggtttgcc acagggaaga gctgcggtcc 2160 

tcgaggggcc ttcatgcagc tcatcacagt ggctgagggc ttcagccagg acctgggctg 2220 

tgaccacatc ctggtgatag actccggggg cttgataggt ggggcdttga cgtcagctgg 22?0 

ggacagattt gagctggagg cttccttggc cactetgctc atgggactga gcaatgtcac 2340 

cgtgatcagt ctagctgaaa ccaaggacat tccagcagct attctgcatg catttctgag 2400 

gttagaaaaa acggggcaca tgccgaacta ccagtttgta taccagaacc ttcatgatgt 2 4 60 

atctgttccc ggccctaggc ccagagacaa gagacagctc ctggatccac ctggtgacct 2520 

gagcagggct gcagcccaga tggagaaaca gggcgacggc ttccgggcac tggcaggcct 2580 

ggccttctgc gaccctgaga agcagcacat ctggcacatc ccaggcctgt ggcacggagc 2 64 0 

acctcecatg gccgcagtga gcttggccta cagtgaagcc ataitttgaat tgaagagatg 2700 

cctactcgaa aacaltcagga acggcttgtc gaaccaaaac aaaaacatcc agcagctcat 2760 

tgagctggtg agacggctgt gagtgtgcag agaaacccag ttcaggtgta ggaggctgct 2820 

gtgggcagcc ctgtctgatg gggcacccgt gtggggctgt gctctggtgc ctgagaatgg 2880 

ctggtgccca atcgacatga gaagacgagc agaagacaga gggtttggag tctcctc.aac 2 940 

agtgttaaaa gaggaagtga cctcacagac cagctcagag atgttaccaa gaatatcaca 3000 

gcccccaggg tagggagaca agcagcagtt tgttctgtct cagctcctgt caaggatcct 3060 

gcggggtggg ccctctgtat agctgctctc tgtcactggc ccctggagtg ggagcagcgt 3120 

ccttagtcac tgcaggccca ggcgggcagg tggtcccagg acagaggtgg ggaagttgtc 3180 

ctgaggaagc agaagtaggc cttgctcccg cccaacccaa gggcctccag tggaccagca 3240 

ttcaagatgt gagtgcccgt ggtgtgcaag gcactcccat ggcaccgtat ttattgactg 3300 

atctgtgaag gcttccctga cccctgccca ggaagagttc actggtcgct ctgttgtgcc 3360 

ccacagcact ttgttatacc tctgccacac acttcacgca gcgcgttgta actcatgtgt 3420 

ttacatgtct gtccccccag actgtgagqt ccttgagggc agggactgta cattctccag 3480 

ctctgtgtcc ccagggcctg gcacattgta gacgcttaat aaatgtctgt taaatgaaaa 354 0 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3600 
aaaaaag 3607 

<210> 12 

<211> 753 

<212> DNA 

<213> Homo sapiens 



<400> 12 

cttaccgaca gacagacgct gggacccacg 
agccctgcgc ggggcagggg gtctggaacc 
tgctgagcct gtgcttcctg agaacagcag 
tccttggfccc catctacctc ctcttcatcc 



acgacagaag gcgccgatgg ccgcctgctg 60 
agacagagcc tgaacctgcc gccaccagcc 120 
gggtctgggt accccccatg tacctctggg 180 
accaccatgg ccggggctac ctccggatgt 24 0 
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ccccactctt caaagccaag atggtagctg ccatccctgg gagcctggaa ccaggcaatg 300 
ttcgggggag gcaggggaca ggctggaacc tggtgaagtc ttaaagtaga ctcctectat 360 
cggggtgtag aagggaatct gttaatcaaa cagagcaata ttagaaaggc tacagaggtc 420 
aactcagtgg aacatggttc tcccaaacag attttgtaat tccgaaaatc cacgcatgcg 4 80 
caaacatacg catacactcc catgttcctg gacagtttat agctaccata acctggcatt 540 
ttccaaaaca taccatgtag actcttggat acacaaggta attttagggc cacattagga 60O 
tgaacctttt aaaaggttat gcatttattH ttatgtttcc ccactagctg tattatagga 660 
caatttttat atgtgatatg tatttacctt agtgtgttaa ataaacactg gcatttcaaa 720 
aaaaaaaaaa aaaaaaaaaa aaagcggccg ctg 753 

<210> 13 
<211> 2251 
<212> DNA 

<2l3> Homo sapiens 
<400> 13 

gaattctagg gggaacctca gacccccctc actccttcag ggggaggtga tgggacccct 60 
tcctcaccca ggggccctga gtccccccga ctggcagcag ggccctctcc ctgctggcac 120 
ctgggagcca tgcatgaatc aaggagtcgc tggacagagc ctgggtgttc ccagtgctgg 180 
tgcgaggacg ggaaggtgac ctgtgaaaag gtgaggtgtg aagctgcttg ttcccaccca 24 0 
attccctcca gagatggtgg gtgctgccca tcgtgcacag gctgttttca cagtggtgtc 300 
gtccgagctg aaggggatgt gttttcacct cccaatgaga actgcaccgt ctgtgtctgt 360 
ctggctggaa acgtgtcctg catctctcct gagtgtcctt ctggcccctg tcagaccccc 420 
ccacagacgg attgctgtac ttgtgttcca gtgagatgct atttccacgg ccggtggtac 4 80 
gcagacgggg ctgtgttcag tgggggtggt gacgagtgta ccacctgtgt ttgccagaat 54 0 
ggggaggtgg agtgctcctt catgccctgc cctgagctgg cctgcccccg agaagagtgg 600 
cggctgggcc ctgggcagtg ttgcttcaec tgccaggagc ccacaccctc gacaggctgc 660 
tctcttgacg acaacggggt tgagtttccg attggacaga tctggtcgcc tggtgacccc 720 
tgtgagttat gcatctgcca ggcagatggc tcggtgagct gcaagaggac agactgtgtg 780 
gactcctgcc ctcacccgat ccggatccct ggacagtgct gcccagactg ttcagcaggc 84 0 
tgcacctaca caggcagaat cttctataac aacgagacct tcccgtctgt gctggaccca 900 
tgtctgagct gcatctgcct gctgggctca gtggcctgtt cccccgtgga ctgccccatc 960 
acctgtacct accctttcca ccctgacggg gagtgctgcc ccgtgtgccg agactgcaac 1020 
tacgagggaa ggaaggtggc gaatggccag gtgttcacct tggatgatga accctgcacc 1080 
cggtgcacgt gccagctggg agaggtgagc tgtgagaagg ttpcctgcca gcgggcctgt 1140 
gccgaccctg ccctgcttcc tggggactgc tgctcttcct gtccagattg ccctgtctcc 1200 
tctggaagaa aagcagggge tctcccctca cggaaaatgt ggcattcagc aaagctggtc 1260 
gggagcctgc atggagacac tgaggcccct gtcaactgta gctcctgtcc tgggcccccg 1320 
acagcatcat cctcgaggcc ggtgcttcat ctcctccagc tccttttaag aacgaacttg 1380 
atgaaaacac agactttacc tacaagcccg gcaggagctc atggtccaca ctcactcgct 14 40 
ttggggctga cagccacttt cccaggggag cctggggcct cccctcgact ctcaccaggg 1500 
ccttcgaccc ctccaggagc ccccactcta cctctagctt ccccaggggc tcctcagcca 1560 
cctcctgtga ctccagagcg ctcgttctca gcctctgggg cccagatagt gtccaggtgg 1620 
cctcctctgc ctggcaccct cctgacggaa gcttcagcac tttccatgat ggaccccagc 1680 
ccctcgaaga cccccatcac cctcctcggg cctcgcgtgc tttctcccac cacctctaga 1740 
ctctccacag cccttgcagc caccacccac cctggccccc agcagccccc agtgggggct 1800 
tctcgggggg aagagtccac catgtaagga ggtcactgtg tccgggagac tctggagaga 18 60 
ggacctctgc cagtggccca gggtgtgtgc agggcacctc caaggatgaa cctggtgggg 1920 



6 



WO 99/39200 PCT/US99/01894 

atgcctgggc tccctcctgc acgggccctg gtgaggatgg aagaccccca aggctggatg 1980 

taaccttgtt cccaagaagt gtttggaatg tgctgtaaga atggaggaag tcgtttccac 2040 

tgtcagcatc ctcccctgga ccgcgtggct ggctcatctt ttgagaaggg ttgggactgc 2100 

caagttttcc tggaggaaga gttgcgtccg gctgggattc cactcactgg gactgtaccg 2160 

ccaggtgtca tgcgtctttc tgaggtttcc tgattaaagg ttgtttcggt ttcctaaaaa 2220 
aaaaaaaaaa aaaaaaaaaa aaaaagcggc c 2251 

<210> 14 . 

<211> 541 

<212> DNA 

<213> Homo sapiens 

<406> i4 

gccacaatgg tgcgcatgaa tgtcctggca gatgctetca agagtatcaa caatgccgaa 60 
aagagaggca aacgccaggt gcttattagg ccgtgctcca aagtcatcgt ccggtttctc 120 
actgtgatga tgaagcatgg ttacattggc gaatttgaaa tcattgatga ccacagagct 180 
gggaaaattg ttgtgaacct cacaggcagg ctaaacaagt gtggggtgat aagccccaga 240 
tttgacgtgc aactcaaaga cctggaaaaa tggcagaata atctgcttcc atcccgccag 300 
tttggtttca ttgtactgac aacctcagct ggcaticatgg accatgaaga agcaagacga 360 
aaacacacag gagggaaaat cctgggattc tttttctagg gatgtaatac atatatttac 420 
aaataaaatg cctcatggac tctggtgctt ccaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4 80 
aaaaaaaaaa aaaaaaaaaa aaaaattaaa aaaaaaaaaa aaaaaaaaaa aaaaagcggc 540 
c 541 

<210> 15 
<211> 660 
<212> DNA 

<213> Homo sapiens 
<400> 15 

gccgccgycg aggattcagc agcctccccc ttgagccccc tcgcttcccg acgttccgtt 60 
cccccctgcc cgccttctcc cgccaccgcc gccgccgcct tccgcagccg tttccaccga 120 
ggaaaaggaa tcgtatcgta tgtccgctat ccagaacctc cactctttcg acccctttgc 180 
tgatgcaagt aagggtgatg acctgcttcc tgctggcact gaggattata tccatataag 240 
aattcaacag agaaa.cggca ggaagaccct tactactgtc caagggatcg ctgatgatta 300 
cgataaaaag aaactagtga aggcgtttaa gaaaaagttt gcctgcaatg gtactgtaat 360 
tgagcatccg gaatatggag aagtaattca gctacagggt gaccaacgca agaacatatg 420 
ccagttcctc gtagagattg gactggctaa ggacgatcag ctgaaggttc atgggtttta 480 
agtigcttgtg gctcactgaa gcttaagtga gga.tttcctt gcaatgagta gaatttccct 540 
tctctccctt gtcacaggtt taaaaacctc acagcttgta taatgtaacc atttggggtc 600 
cgcttttaac ttggactagt gtaactcctt catgcaataa actgaaaaga gccatgcaaa 660 

<210> 16 
<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 16 
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<212> PRT 

<213> Homo sapiens 

<400> 20 

Cys Gly Val Trp Asn Gin Thr Gla Pro Glu Pro Ala Ala Thr Ser 
1 5 10 15 



<210> 21 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 21 

His His His Gly Arg Gly Tyr Leu Arg Met Ser Pro Leu Phe iys Cys 
1 5 10 15 



<210> 22 

<2li> 16 

<212> PRT 

<213> Homo sapiens 

<400> 22 

Pro Cys Pro Glu Leu Ala Cys Pro Arg Glu Glu Trp Arg Leu Gly Pro 
1 5 10 15 



<210> 23 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 23 

Asp Pro Ser Arg Ser Pro His Ser Thr Ser Ser Phe Pro Arg Gly Ser 



10 15 



Ser Ala Thr Ser Cys Asp Ser Arg 
20 



<210> 24 

<211> 21 

<212> PRT 

<213> Homo sapiens 



<400> 24 

His Pro Pro Asp Gly Ser Phe Ser Thr Phe His Asp Gly Pro Gin Pro 
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1 5 10 15 

Leu Glu Asp Pro Cys 
20 



<210> 25 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 25 

Lys Ser lie Asn Asn Ala Glu Lys Arg Gly Lys Arg Cys 
15 10 

<210> 26 
<211> 14 
<212> PRT 

<213> Homo sapiens 
<4 00> 2 6 

Asp His Glu Glu Arg Arg Arg Lys His Thr Gly Gly Lys Cys 
1 5 10 
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